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<120> 

<130> 
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<141> 
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<150> 
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<160> 
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<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<221> 
<222> 
<400> 
a gat 
Asp 
1 



APPLICANT: Braun, Jonathan 
Sutton, Christopher L. 

TITLE OF INVENTION: IBD- Associated Microbial Nucleic Acid 
Molecules 

FILE REFERENCE: P-PM 4966 

CURRENT APPLICATION NUMBER: US/09/966 , 608 
CURRENT FILING DATE: 2001-09-27 

PRIOR APPLICATION NUMBER: US 09/303,120 

PRIOR FILING DATE: 1999-04-30 

PRIOR APPLICATION NUMBER: US 09/820,576 

PRIOR FILING DATE: 2001-03-28 

NUMBER OF SEQ ID NOS : 10 

SOFTWARE: FastSEQ for Windows Version 4.0 

SEQ ID NO: 1 

LENGTH: 302 

TYPE: DNA 

ORGANISM: Unknown 

FEATURE : 

OTHER INFORMATION: Microbial Organism from the human gut 



ENTERED 



NAME/KEY 
LOCATION 
SEQUENCE 
ctg gcc 



CDS 

(2) . . . (301) 
1 

age gcc gtg ggc 



ate cag tec ggc age ate 



Leu Ala Ser Ala Val Gly lie Gin Ser Gly Ser lie 
5 10 



ttt 
Phe 



cat cac 
His His 
15 



49 



ttc 


aag 


age 


aag 


gat 


gag 


ata 


ttg 


cgt 


gcc 


gtg 


atg 


gag 


gaa 


acc 


ate 


97 


Phe 


Lys 


Ser 


Lys 
20 


Asp 


Glu 


He 


Leu 


Arg 
25 


Ala 


Val 


Met 


Glu 


Glu 
30 


Thr 


He 




cat 


tac 


aac 


ace 


gcg 


atg 


atg 


cgc 


get 


tea 


ctg 


gag 


gag 


gcg 


age 


acg 


145 


His 


Tyr 


Asn 
35 


Thr 


Ala 


Met 


Met 


Arg 
40 


Ala 


Ser 


Leu 


Glu 


Glu 
45 


Ala 


Ser 


Thr 




gtg 


cgc 


gaa 


cgc 


gtg 


ctg 


gcg 


ctg 


ate 


cgc 


tgc 


gag 


ttg 


cag 


teg 


ate 


193 


Val 


Arg 
50 


Glu 


Arg 


Val 


Leu 


Ala 
55 


Leu 


He 


Arg 


Cys 


Glu 
60 


Leu 


Gin 


Ser 


He 




atg 


ggc 


ggc 


agt 


ggc 


gag 


gcc 


atg 


gcg 


gtg 


ctg 


gtc 


tac 


gaa 


tgg 


cgc 


241 


Met 


Gly 


Gly 


Ser 


Gly 


Glu 


Ala 


Met 


Ala 


Val 


Leu 


Val 


Tyr 


Glu 


Trp 


Arg 




65 










70 










75 










80 




teg 


ctg 


teg 


gcc 


gaa 


ggc 


cag 


gcg 


cac 


gtg 


ctg 


gcc 


ctg 


cgt 


gac 


gtg 


289 


Ser 


Leu 


Ser 


Ala 


Glu 
85 


Gly 


Gin 


Ala 


His 


Val 
90 


Leu 


Ala 


Leu 


Arg 


Asp 
95 


Val 




tat 


gag 


cag 


ate 


t 
























302 


Tyr 


Glu 


Gin 


He 





























100 

<210> SEQ ID NO: 2 
<211> LENGTH: 100 
<212> TYPE: PRT 
<213> ORGANISM: Unknown 
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68 <220> FEATURE: 

69 <223> OTHER INFORMATION: Microbial organism from the human gut 

71 <400> SEQUENCE: 2 

72 Asp Leu Ala Ser Ala Val Gly lie Gin Ser Gly Ser lie Phe His His 

73 1 5 10 15 

74 Phe Lys Ser Lys Asp Glu lie Leu Arg Ala Val Met Glu Glu Thr lie 

75 20 25 30 

76 His Tyr Asn Thr Ala Met Met Arg Ala Ser Leu Glu Glu Ala Ser Thr 

77 35 40 45 

78 Val Arg Glu Arg Val Leu Ala Leu lie Arg Cys Glu Leu Gin Ser lie 

79 50 55 60 

80 Met Gly Gly Ser Gly Glu Ala Met Ala Val Leu Val Tyr Glu Trp Arg 

81 65 70 75 80 

82 Ser Leu Ser Ala Glu Gly Gin Ala His Val Leu Ala Leu Arg Asp Val 

83 85 90 95 

84 Tyr Glu Gin lie 

85 100 

88 <210> SEQ ID NO: 3 

89 <211> LENGTH: 392 

90 <212> TYPE: DNA 

91 <213> ORGANISM: Unknown 

93 <220> FEATURE: 

94 <223> OTHER INFORMATION: Microbial Organism from the human gut 

96 <221> NAME/KEY: CDS 

97 <222> LOCATION: (2)... (346) 

99 <221> NAME/KEY: misc_feature 

100 <222> LOCATION: (1)...(392) 

101 <223> OTHER INFORMATION: n = A,T,C or G 

103 <400> SEQUENCE: 3 

104 a gat ctt gag cgt cat gag tgc ctg ggg tac gcc ttt tea teg cgt ccg 4 9 

105 Asp Leu Glu Arg His Glu Cys Leu Gly Tyr Ala Phe Ser Ser Arg Pro 



106 




1 






i 


5 








10 








15 




108 


gcg 


gat 


cga 


gag 


tgg 


gtg 


ttt 


ttt 


cag 


ggc 


acg 


gtt 


tec 


tac 


aag 


gta 


97 


109 


Ala 


Asp 


Arg 


Glu 


Trp 


Val 


Phe 


Phe 


Gin 


Gly 


Thr 


Val 


Ser 


Tyr 


Lys 


Val 




110 








20 










25 










30 








112 


cga 


gtg 


gcc 


age 


cgt 


ttg 


etc 


ate 


aat 


gaa 


age 


egg 


gca 


ttg 


atg 


teg 


145 


113 


Arg 


Val 


Ala 


Ser 


Arg 


Leu 


Leu 


He 


Asn 


Glu 


Ser 


Arg 


Ala 


Leu 


Met 


Ser 




114 






35 










40 










45 










116 


gcg 


gca 


ttg 


gat 


ggt 


ttt 


ggc 


ata 


gtg 


etc 


ggc 


ccg 


caa 


gac 


ttc 


ctg 


193 


117 


Ala 


Ala 


Leu 


Asp 


Gly 


Phe 


Gly 


He 


Val 


Leu 


Gly 


Pro 


Gin 


Asp 


Phe 


Leu 




118 




50 










55 










60 












120 


cga 


acg 


gcg 


ttg 


gcg 


agt 


ggc 


gag 


ttg 


gtg 


egg 


gtg 


ttg 


ccg 


gag 


ttt 


241 


121 


Arg 


Thr 


Ala 


Leu 


Ala 


Ser 


Gly 


Glu 


Leu 


Val 


Arg 


Val 


Leu 


Pro 


Glu 


Phe 




122 


65 










70 










75 










80 




124 


gag 


get 


ccg 


agt 


egg 


teg 


atg 


cat 


ttg 


gtc 


tac 


acc 


gca 


aac 


cgc 


cag 


289 


125 


Glu 


Ala 


Pro 


Ser 


Arg 


Ser 


Met 


His 


Leu 


Val 


Tyr 


Thr 


Ala 


Asn 


Arg 


Gin 




126 










85 










90 










95 






128 


cgt 


acc 


gcc 


aag 


ttg 


cgc 


tgc 


ttt 


gtc 


gag 


act 


gtg 


ctg 


gga 


cgt 


ttt 


337 


129 


Arg 


Thr 


Ala 


Lys 


Leu 


Arg 


Cys 


Phe 


Val 


Glu 


Thr 


Val 


Leu 


Gly 


Arg 


Phe 
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1 jU 






100 










105 










110 






W--> 132 


ggt 


ccg gta 


tgaaggagca ccaccgtggc ggtcgccggg angcacctaa 








Gly 


Pro Val 




























1 "3 A 

1.34 




115 




























loo 


agatct 




























TOO 
lib 


<210> SEQ ID NO 


: 4 
























1 OQ 


<211> LENGTH: 115 
























1 A A 

14U 


<212> TYPE: 


PRT 


























141 


<213> ORGANISM: 


Unknown 






















14 j 


<220> FEATURE: 


























1 A A 

144 


<223> OTHER 


INFORMATION 


: Microbial organism 


from the human gut 


14 0 


<400> SEQUENCE : 


4 
























14 / 


Asp 


Leu Glu 


Arg 


His 


Glu 


Cys 


Leu 


Gly 


Tyr 


Ala 


Phe 


Ser 


Ser 


Arg 


Pro 


14 o 


1 






5 










10 










15 




14 y 


Ala 


Asp Arg 


Glu 


Trp 


Val 


Phe 


Phe 


Gin 


Gly 


Thr 


Val 


Ser 


Tyr 


Lys 


Val 


1 K A 
1DU 






20 










25 










30 






1 CI 

iDl 


Arg 


Val Ala 


Ser 


Arg 


Leu 


Leu 


He 


Asn 


Glu 


Ser 


Arg 


Ala 


Leu 


Met 


Ser 


1d2 




35 










40 










45 








Id j 


Ala 


Ala Leu 


Asp 


Gly 


Phe 


Gly 


He 


Val 


Leu 


Gly 


Pro 


Gin 


Asp 


Phe 


Leu 


1d4 




50 








55 










60 










Ijj 


Arg 


Thr Ala 


Leu 


Ala 


Ser 


Gly 


Glu 


Leu 


Val 


Arg 


Val 


Leu 


Pro 


Glu 


Phe 


1DO 


65 








70 










75 










80 


ID / 


Glu 


Ala Pro 


Ser 


Arg 


Ser 


Met 


His 


Leu 


Val 


Tyr 


Thr 


Ala 


Asn 


Arg 


Gin 


15o 








85 










90 










95 




ioy 


Arg 


Thr Ala 


Lys 


Leu 


Arg 


Cys 


Phe 


Val 


Glu 


Thr 


Val 


Leu 


Gly 


Arg 


Phe 


1 £A 
10U 






100 










105 










110 






101 


Gly 


Pro Val 




























loz 




115 




























ICC 

loo 


<210> SEQ ID NO: 


: 5 
























loo 


<211> LENGTH: 114 
























10 / 


<212> TYPE: 


PRT 


























loo 


<213> ORGANISM: 


Unknown 






















1 / u 


<220> FEATURE: 


























1/1 


<223> OTHER 


INFORMATION 


: Microbial Organism 


from the human gut 


1 / -5 


<221> NAME/KEY: 


VARIANT 






















1/4 


<222> LOCATION: 


(1). 


. . . (114) 




















J. / J 


<223> OTHER 


INFORMATION; 


: Xaa = Any Amino Acid 










177 


<400> SEQUENCE: 


5 
























178 


Arg 


Thr Arg 


Arg 


He 


Ser 


Leu 


Pro 


His 


Lys 


Lys 


Leu 


Ala 


Arg 


Asn 


Gly 


179 


1 






5 










10 










15 




180 


Val 


Leu Tyr 


Ser 


His 


Gly 


Ala 


Thr 


Gin 


Glu 


Asp 


lie 


Phe 


Ala 


Pro 


Cys 


181 






20 










25 










30 






182 


Gin 


His Arg 


Arg 


Cys 


Gin 


He 


Thr 


Lys 


Ala 


Tyr 


His 


Glu 


Ala 


Arg 


Leu 


183 




35 










40 










45 








184 


Val 


Glu Gin 


Ser 


Arg 


Arg 


Gin 


Arg 


Thr 


Ala 


Leu 


Gin 


His 


Pro 


His 


Gin 


185 




50 








55 










60 










186 


Arg 


Leu Lys 


Leu 


Ser 


Arg 


Thr 


Pro 


Arg 


His 


Met 


Gin 


Asp 


Val 


Gly 


Cys 


187 


65 








70 










75 










80 


188 


Val 


Ala Leu 


Thr 


Gly 


Gly 


Leu 


Gin 


Ala 


Ala 


Lys 


Asp 


Leu 


Ser 


His 


Gin 
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1 ftQ 

X O 27 










85 










90 










95 




190 

X J \J 


Ser 


Thr 


Lys 


Thr 


Arg 


Tyr 


Ser 


Pro 


Ala 


Gly Gly 


His 


Arcr 


Asp 


Glv 


Pro 


1 Q1 








100 










105 










110 








Xaa 


Val 






























X .7 D 


<210> SEQ ID NO 


: 6 
























1 Q7 


<211> LENGTH: 190 
























1 Qft 
x ;? o 


<212> TYPE: 


PRT 


























1 QQ 
_l _7 


<213> ORGANISM: 


Clostridium 


pasteurianum 












o n i 


<4 00> SEQUENCE: 


u 
























Z \J Z 


Mc U 


Asn 


Lys 


JL 11X 


Lys 




Aon 
no 11 


lie 


IT 11C 


iyr 




Ala 


He 


Lys 


Val 


Phe 


90 ^ 

Z \J «J 


1 
X 








J 










1 0 
x \j 










15 




904. 
z u *± 


Ser 


Asn 


Asn 


C 1 \7 

\j xy 


lyr 


As n 


nl \7 


Al * 


T" V» t* 
1 IIX 


Mot 


Asp 


Glu 


He 


Ala 


Ser 


Asn 


90 "=1 








20 










9 ^ 
Z J 










J u 






90fi 

Z \J o 


Ala 


Gly 


Val 


Ala 


J_iy o 




T Vi y* 
1 11! 


JJCU 


iyr 


iyr 


n x o 


Phe 


Lys 


Ser 


Lys 


Glu 


907 






35 










AO 










45 








90 A 
z u o 


Glu 


He 


Phe 


Lys 


Tyr 


Tip 

JL 1c 


T ] A 

jl xe 


ulU 






val 


Acn 


ucu 




Lys 


Asn 


90Q 
z u y 




50 










55 










fift 
U \J 










910 


Glu 


He 


Asp 


Glu 


Ala 


Thr 


Asp 


Lys 


Glu 


Lys 


Thr 




iJCU 


Glu 




Leu 


91 1 

Z JL JL 


65 










70 










75 










80 


919 

Z JL Z 


Lys 


Ala 


Val 


Cys 


Arg 


Val 


Gin 


Leu 


Asn 


Leu 


He 


T VT* 

xyx 


T.vc 
i_<_y o 


noil 


Arg 


rA. o 


■ 91 ^ 










85 










90 














9 1 A 

Z _L *± 


Phe 


Phe 


Lys 


Val 


He 


Ala 


Ser 


Gin 


Leu 


Trp 


Gly 


T,VC 

j-iy o 


Glu 


Leu 


Arg 


Gin 


91 S 

Z X J 








100 










105 










110 






91 
Z X 0 


Leu 


Glu 


Leu 


Arg 


Asp 


lie 


Met 


Arg 


Asn 


Tyr 


Val 


Va 1 
V d J_ 




Tip 


vj X U 


nin 

\J X u. 


917 

Z J. / 






115 










120 










125 








91ft 

Z X o 


Phe 


Val 


Lys 


Asp 


Ala 


Met 


Glu 


Ala 


Gly 


Ser 


He 


LyS 


Lys 


Gly 


Asn 


Ser 


9 1 Q 

Z. X _7 




130 










135 










140 










9 90 
zzu 


Leu 


Phe 


Val 


Ala 


Tyr 


Ala 


Phe 


Leu 


Gly 


Thr 


Leu 




OCX 


v d x 


OCX 


XiC u. 


991 
Z Z JL 


145 










150 










155 










1 fif) 

X \J \J 


999 
Z Z Z 


Tyr 


Glu 


Val 


He 


Asn 


Ala 


Glu 


Asn 


Asp 


Asn 


He 


Acn 
noil 


Aon 

t\j LI 


x nx 


Tip 
X xc 


uXU 


223 










165 










170 










175 




9 94 
z z *± 


Asn 


Leu 


Met 


Asn 


Tyr 


He 


Leu 


Asn 


Gly 


He 


Gly 


Leu 


Gin 


Asn 






99S 

Z Z J 








180 










185 










190 






99ft 


<210> SEQ ID NO; 


: 7 
























99Q 

Z Z 27 


<211> LENGTH: 200 
























9 in 


<212> TYPE: 


PRT 


























231 


<213> ORGANISM: 


Mycobacterium tuberculosis 












233 


<400> SEQUENCE: 


7 
























234 


Met 


Asp 


Arg 


Val 


Ala 


Gly 


Gin 


Val 


Asn 


Ser 


Arg 


Arg 


Gly 


Glu 


Leu 


Leu 


235 


1 








5 










10 










15 




236 


Glu 


Leu 


Ala 


Ala 


Ala 


Met 


Phe 


Ala 


Glu 


Arg 


Gly 


Leu 


Arg 


Ala 


Thr 


Thr 


237 








20 










25 










30 






238 


Val 


Arg 


Asp 


He 


Ala 


Asp 


Gly 


Ala 


Gly 


He 


Leu 


Ser 


Gly 


Ser 


Leu 


Tyr 


239 






35 










40 










45 








240 


His 


His 


Phe 


Ala 


Ser 


Lys 


Glu 


Glu 


Met 


Val 


Asp 


Glu 


Leu 


Leu 


Arg 


Gly 


241 




50 










55 










60 










242 


Phe 


Leu 


Asp 


Trp 


Leu 


Phe 


Ala 


Arg 


Tyr 


Arg 


Asp 


He 


Val 


Asp 


Ser 


Thr 


243 


65 










70 










75 










80 


244 


Ala 


Asn 


Pro 


Leu 


Glu 


Arg 


Leu 


Gin 


Gly 


Leu 


Phe 


Met 


Ala 


Ser 


Phe 


Glu 
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245 






85 










90 










95 




246 


Ala lie Glu 


His 


His 


His 


Ala 


Gin 


Val 


Val 


He 


Tvr 


Gin 


Asp 


Glu 


Ala 


247 




100 










105 










110 






24 8 


Gin Arg Leu 


Ala 


Ser 


Gin 


Pro 


Arg 


Phe 


Ser 


Tvr 


lie 


Glu 


Asp 


Arg 


Asn 


24 9 


115 










120 










125 








250 


Lys Gin Gin 


Arg 


Lys 


Met 


Trp 


Val 


Asp 


Val 


Leu 


Asn 


Gin 


Glv 


He 


Glu 


251 


130 








135 










140 










252 


Glu Gly Tyr 


Phe 


Arg 


Pro 


Asp 


Leu 


Asp 


Val 


Asp 


Leu 


Val 


Tvr 


Arg 


Phe 


253 


145 






150 










155 










160 


9 


lie Arg Asp 


Thr 


Thr 


Trp 


Val 


Ser 


Val 


Arg 




Tvr 


Arg 


Pro 


Glv 


Glv 


255 






165 










170 










175 




9 Sfi 

Z JO 


Pro Leu Thr 


Ala 


Gin 


Gin 


Val 


Gly 


OJ.ll 








nlU 


Tip 


Val 


Leu 


9 S7 




180 










185 

_L O ~i 










190 






9 Sfl 

Z J o 


Gly Gly He 


Thr 


Lys 


Glu Gly Val 


















9 SQ 
z Dy 


195 










200 


















9 fi9 
z u z 


<210> SEQ ID NO: 


: 8 
























9 

Z U J 


<211> LENGTH: 192 
























9 A 


<212> TYPE: 


PRT 


























9 S 

Z D J 


<213> ORGANISM: 


Auifex aeolicus 


















9 67 
Z\j f 


<4 00> SEQUENCE: 


8 
























9 ft 
zoo 


Met Tyr He 


Leu 


Leu 


Phe 


Met 


Gly 


VJJ 1 Li. 


T.VQ 


Ar g 


C -p 




Thr 


T.VQ 


Glu 


9fi Q 

* U y 


1 




5 










1 0 










1 5 
i ~j 




970 
z / \J 


Lys He Leu 


Ser 


Ser 


Ala 


Leu 


Lys 






Ser 


Lys 


Lys 


Gly 


Phe 


Lys 


971 
Z / -L 




20 










9 ^ 

Z ~j 










^n 






979 
z / z 


Glu Thr Thr 


He 


Lys 


Asp 


He 


Ala 


Lys 


Glu 


Val 


Glv 


He 


Thr 


Glu 


Gly 


97 


35 










40 










** ~j 








974 
z / ** 


Ala He Tyr 


Arg 


His 


Phe 


Thr 


Ser 


T, VQ 


VJJ.U 


Glu 


He 


He 


Lys 


Ser 


Leu 


97 S 

Z f ~J 


50 








55 










fin 










97fi 
z / o 


Leu Glu Ser 


lie 


Thr 


Lys 


Glu 


Leu 


Arg 


His 


Lys 


Leu 


Glu 


Val 


Ala 


Leu 


977 
z / / 


65 






70 










75 










80 


278 


Gin Arg Gly 


Glu 


Thr 


Asp 


Glu 


Glu 


He 


Leu 


Glu 


Ser 


He 


Val 


Asp 


Thr 


97Q 
z / y 






85 




















95 




280 


Leu He Asp 


Tyr 


Ala 


Phe 


Ser 


Asn 


Pro 


Glu 


Ser 


Phe 


Arg 


Phe 


Leu 


Asn 


9 ft 1 

Z O -L 




100 










ins 










110 






9 ft 9 
z o z 


Leu Tyr His 


Leu 


Leu 


Lys 


Glu 


Tyr 




uiU 


Val 


Lys 


Asn 


Leu 


Pro Gly 


9ft ^ 

Z O ~J 


115 










120 










125 








z o 4 


Glu Leu He 


Leu 


Lys 


Phe 


Leu 


Asn 




Leu 


Tyr 


Leu 


Lys 


Arg 


Lys 


Leu 


285 


130 








135 










140 










286 


Lys Thr Tyr 


Pro 


Glu 


He 


Ala 


Leu 


Ala 


Val 


Val 


Thr 


Gly 


Ser 


Val 


Glu 


287 


145 






150 










155 










160 


288 


Arg Val Phe 


He 


Phe 


Lys 


Glu 


Arg 


Asn 


Phe 


Leu 


Asp 


Tyr 


Asp 


Glu 


Glu 
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Thr He Lys 


Lys 


Glu 


Leu 


Lys 


Lys 


Val 


Leu 


Lys 


Ser 


Ala 


He 


Leu 


Ala 
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<210> SEQ ID NO: 


9 
























295 


<211> LENGTH: 18 
























296 


<212> TYPE: 


DNA 


























297 


<213> ORGANISM: 


Unknown 






















299 


<220> FEATURE: 
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VERIFICATION SUMMARY DATE: 10/18/2001 

PATENT APPLICATION: US/09/966,608 TIME: 17:10:17 



Input Set : A:\PM4966.txt 

Output Set: N:\CRF3\10182001\l966608.raw 

L:12 M:270 C: Current Application Number differs, Replaced Current Application No 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:132 M:341 W: (46) "n n or "Xaa" used, for SEQ ID# : 3 
L:192 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 



file://C:\CRF3\Outhold\VsrI966608.htm 



10/18/01 



